Machine Learning for Online Algorithm Selection under Censored Feedback
نویسندگان
چکیده
In online algorithm selection (OAS), instances of an algorithmic problem class are presented to agent one after another, and the has quickly select a presumably best from fixed set candidate algorithms. For decision problems such as satisfiability (SAT), quality typically refers algorithm's runtime. As latter is known exhibit heavy-tail distribution, normally stopped when exceeding predefined upper time limit. consequence, machine learning methods used optimize strategy in data-driven manner need deal with right-censored samples, that received little attention literature so far. this work, we revisit multi-armed bandit algorithms for OAS discuss their capability dealing problem. Moreover, adapt them towards runtime-oriented losses, allowing partially censored data while keeping space- time-complexity independent horizon. extensive experimental evaluation on adapted version ASlib benchmark, demonstrate theoretically well-founded based Thompson sampling perform specifically strong improve comparison existing methods.
منابع مشابه
Online Learning under Delayed Feedback
Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversari...
متن کاملEfficient Online Learning under Ban- dit Feedback
In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm and extend these results to bandits with arbitrary structure that is known to the decision maker. In these settings, we derive problem specific regret lower bounds and propose both an asymp...
متن کاملReinforcement Learning for Automatic Online Algorithm Selection - an Empirical Study
In this paper a reinforcement learning methodology for automatic online algorithm selection is introduced and empirically tested. It is applicable to automatic algorithm selection methods that predict the performance of each available algorithm and then pick the best one. The experiments confirm the usefulness of the methodology: using online data results in better performance. As in many onlin...
متن کاملModern Machine Learning for Automatic Optimization Algorithm Selection
Optimization software is commonly used to solve simulation-based problems such as optimal design and control, model parameter estimation, and best/worst-case scenario identification. While the value of such software is widely recognized, user feedback indicates that these tools are difficult for nonexperts to use. In particular, users are unfamiliar with the details of the plethora of available...
متن کاملMachine Learning for Subproblem Selection
Subproblem generation, solution, and recombination is a standard approach to combinatorial optimization problems. In many settings identifying suitable subproblems is itself a significant component of the technique. Such subproblems are often identified using a heuristic rule. Here we show how to use machine learning to make this identification. In particular we use a learned objective function...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i9.21279